Multidimensional evaluation and predicting overall speech quality

نویسندگان

  • Jens Berger
  • Anna Llagostera
چکیده

The quality of speech samples has been traditionally evaluated in subjective listening tests using 5-point Absolute Category Rating (ACR) scales in Listening Only Tests (LOT) as recommended in ITU-T P.800 [1]. Those tests provide the listening quality aspect of speech quality. There are other tests are under discussion and proposed in order to assess in detail individual perceptual dimensions of speech. In this paper we investigate the relationship between the overall listening quality obtained in an ITU-T P.800 ACR subjective test and the rating of the same signals in four dimensions proposed by Wältermann [2], namely noisiness, discontinuity, coloration and loudness. The database we use is composed of conditions and speech signals extracted from an ACR LOT used in the ITU-T P.863 evaluation, processed by simulated and live telecommunication channels [3]. The signals have been re-scored using the four mentioned scales and are foreseen as contribution to the ITU-T P.AMD project. This paper focuses on the modeling of an ACR LOT score based on individual dimensional ratings under the assumption of orthogonality of the four dimensions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Providing a Multidimensional Measurement Model for Assessing Mobile Telecommunication Service Quality (MS-Qual)

Because of the need to develop specific measurement scales for different services industries, this study aimed to empirically develop a reliable and valid model specifically for measuring mobile telecommunication service quality. A multidimensional measurement model (MS-Qual) has been proposed based on an extensive literature review and then, to assess the model validity, convergent and discrim...

متن کامل

Evaluation of objective measures for speech enhancement

In this paper, we evaluate the performance of several objective measures in terms of predicting the quality of noisy speech enhanced by noise suppression algorithms. The objective measures considered a wide range of distortions introduced by four types of real-world noise at two SNRs by four classes of speech enhancement algorithms: spectral subtractive, subspace, statistical-model based and Wi...

متن کامل

Perceptual Dimensions of Wideband-transmitted Speech

In this paper it is analyzed which perceptual dimensions are existent for speech that is transmitted over wideband telephone connections. Therefore, two auditory experiments with subsequent multidimensional analyses (multidimensional scaling and semantic differential) were carried out with a diverse set of mixed narrowband and wideband conditions. This revealed a mapping of the perceptual space...

متن کامل

Perceptual Quality Dimensions of Text-to-Speech Systems

The aim of this paper is to analyze the perceptual quality dimensions of state-of-the-art text-to-speech systems (TTS). Therefore, several pretests were conducted to determine a suitable set of attribute scales. The resulting 16 scales were used in a semantic differential on a diverse database containing 16 different TTS systems. A subsequent multidimensional analysis (Principal Axis Factor ana...

متن کامل

Listeners' weighting of acoustic cues to synthetic speech naturalness: A multidimensional scaling analysis

The quality of current commercial speech synthesis systems is now so high that system improvements are being made at subtle suband supra-segmental levels. Human perceptual evaluation of such subtle improvements requires a highly sophisticated level of perceptual attention to specific acoustic characteristics or cues. However, it is not well understood what acoustic cues listeners attend to by d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015